Principled Monitoring of Distributed Agents for Detection of Coordination Failure
نویسندگان
چکیده
There is a very rich variety of systems of autonomous agents, be it software or robotic agents. In particular, multi-agent systems can include agents that may be part of a team and need to coordinate their actions during their distributed task execution. This coordination requires an agent to observe, i.e., to monitor, the other agents in order to detect a possible coordination failure of the team. Several researchers have addressed the problem of monitoring for single or multiple agent systems and have contributed successful, but mainly application-specific, approaches. In this paper, we aim at contributing a unifying, domain-independent statement of the distributed multi-agent monitoring problem. We define the problem in terms of a pre-defined desirable joint state and an observation-state mapping. Given a concrete joint observation during execution, we show how an agent can detect a possible coordination failure by processing the observation-state mapping and the desirable joint state. To illustrate the generality of our formalism, one of the main contributions of the paper, we represent several previously studied examples within our formalism. We note that basic failure detection algorithms can be computationally expensive. We further contribute an efficient method for failure detection that builds upon an off-line compilation of the principled relations introduced. We show empirical results that demonstrate this effectiveness.
منابع مشابه
ENERGY AWARE DISTRIBUTED PARTITIONING DETECTION AND CONNECTIVITY RESTORATION ALGORITHM IN WIRELESS SENSOR NETWORKS
Mobile sensor networks rely heavily on inter-sensor connectivity for collection of data. Nodes in these networks monitor different regions of an area of interest and collectively present a global overview of some monitored activities or phenomena. A failure of a sensor leads to loss of connectivity and may cause partitioning of the network into disjoint segments. A number of approaches have be...
متن کاملRobust Agent Teams via Socially-Attentive Monitoring
Agents in dynamic multi-agent environments must monitor their peers to execute individual and group plans. A key open question is how much monitoring of other agents' states is required to be e ective: The Monitoring Selectivity Problem. We investigate this question in the context of detecting failures in teams of cooperating agents, via SociallyAttentive Monitoring, which focuses on monitoring...
متن کاملA Dynamic Group Management Framework for Large-scale Distributed Event Monitoring
Distributed event monitoring is an important service for fault, performance and security management. Next generation event monitoring services are higly distributed and invovling a large number of monitoring agents. In order to support scalabel event monitoring, the monitoring agents use IP multicasting as a group communication for exchanging events and control information. However, dueto the d...
متن کاملChapter 16 TÆMS : A Framework for Environment Centered Analysis & Design of Coordination Mechanisms
The design of coordination mechanisms for groups of computational agents, either interacting with one another or with people, depends crucially on the task environment of which they are a part. Such dependencies include the structure of the environment (the particular kinds and patterns of interrelationships that occur between tasks) and the uncertainty in the environment (both in the a priori ...
متن کاملFault-tolerant Mobile Agent-based Monitoring Mechanism for Highly Dynamic Distributed Networks
Thanks to asynchronous and dynamic natures of mobile agents, a certain number of mobile agent-based monitoring mechanisms have actively been developed to monitor large-scale and dynamic distributed networked systems adaptively and efficiently. Among them, some mechanisms attempt to adapt to dynamic changes in various aspects such as network traffic patterns, resource addition and deletion, netw...
متن کامل